Measurement of Similarity between Nouns

نویسنده

  • Kenneth E. Harper
چکیده

A study was r~ade of tile degree of similarity between pairs of Russian nouns, as expressed by their tendency to occur in sentences with identical ~,,ords in identical syntactic relationships. A similarity matrix was prepared for forty nouns; for each pair of nouns the number of shared (i) adjective dependents, (ii) noun dependents, and (iii) noun governors was automatically retrieved from machine-processed text. The similarity coefficient for each pair ~;as determined as the ratio of the total of such shared ~'ords to the product of the frequencies of the two nouns in the text. The 78~ pairs were ranked according to this coefficient. The text comprised 12(1,~00 running words of physics text processed at The RAND Corporation; the frequencies of occurrence of the forty nouns in this text ranged from 42 to 328. The results suggest that the sample of text is of sufficient size to be useful for the intended purpose. Many noun pairs with similar properties (synonymy, antonym),, derivation from distributionally similar verbs, etc.) are characterized by high similarity coefficients; the converse is not observed. The relevance of various syntactic relationships as criteria for meas~rement is discussed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a Semantic Similarity Judgment Test for Persian Action Verbs and Non-action Nouns in Patients With Brain Injury and Determining its Content Validity

Objective: Brain trauma evidences suggest that the two grammatical categories of noun and verb are processed in different regions of the brain due to differences in the complexity of grammatical and semantic information processing. Studies have shown that the verbs belonging to different semantic categories lead to neural activity in different areas of the brain, and action verb processing is r...

متن کامل

Extraction of Associative Attributes from Nouns and Quantitative Expression of Prototype Concept

One of the purposes of this research is to formalize similarity among nouns by using attributes associated from the nouns, and then using the similarity, to formalize prototypes of categories. The other purpose is to extract features of nouns by using adjectives or adjective-like words obtained by the association experiments and to formalize importance of the nouns with the words. We constructe...

متن کامل

Pii: S0010-0277(99)00034-7

This paper examines children's early noun vocabularies and their interpretations of names for solid and non-solid things. Previous research in this area assumes that ontology, category organization and syntax correspond in the nouns children learn early such that categories of solid things are organized by shape similarity and named with count nouns and categories of non-solid things are organi...

متن کامل

Two Methods of Evaluation of Semantic Similarity of Nouns Based on Their Modifier Sets

Two methods of evaluation of semantic similarity/dissimilarity of English nouns are proposed based on their modifier sets taken from Oxford Collocation Dictionary for Student of English. The first method measures similarity by the portion of modifiers commonly applicable to both nouns under evaluation. The second method measures dissimilarity by the change of the mean value of cohesion between ...

متن کامل

Construction of an Objective Hierarchy of Abstract Concepts via Directional Similarity

The method of organization of word meanings is a crucial issue with lexical databases. Our purpose in this research is to extract word hierarchies from corpora automatically. Our initial task to this end is to determine adjective hyperonyms. In order to find adjective hyperonyms, we utilize abstract nouns. We constructed linguistic data by extracting semantic relations between abstract nouns an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1965